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Abstract 

How should we gather information in a network, 
where each node’s visibility is limited to its local 
neighborhood? This problem arises in numerous 
real-world applications, such as surveying and task 
routing in social networks, team formation in col¬ 
laborative networks and experimental design with 
dependency constraints. Often the informativeness 
of a set of nodes can be quantified via a submod- 
ular utility function. Existing approaches for sub- 
modular optimization, however, require that the set 
of all nodes that can be selected is known ahead 
of time, which is often unrealistic. In contrast, we 
propose a novel model where we start our explo¬ 
ration from an initial node, and new nodes become 
visible and available for selection only once one of 
their neighbors has been chosen. We then present 
a general algorithm NetExp for this problem, and 
provide theoretical bounds on its performance de¬ 
pendent on structural properties of the underlying 
network. We evaluate our methodology on various 
simulated problem instances as well as on data col¬ 
lected from social question answering system de¬ 
ployed within a large enterprise. 

1 Introduction 

Pione ering work of Stanley Milgram in the 1960’s |Milgram~ 
1967| provided evidence that individuals in a social network, 
possessing very limited knowledge of the whole network and 
only being able to access their close acquaintances, can effec¬ 
tively route messages to distant target individuals in the net¬ 
work. Similar phenomena can be observed in many human- 
powered systems across a spectrum of examples, including 
the DARPA Red Balloon challeng^ and co-authorship net¬ 
works in academic communities. The challenges of local 
knowledge and limited visibility often arise in various diverse 
computing systems. This includes querying in peer-to-peer 
networks constrained by the decentralized network design; 
information gathering on the internet by web cra wlers or hu- 
mans via following chains of linked documents [White and 
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Singla, 2()TT| constrained by the inability to directly discover 
new documents; and users seeking experts via friends in on¬ 
line social networks owing to the privacy constraints. How 
can we build systems that autonomously explore networks un¬ 
der limited visibility for sake of information gathering? 

Information gathering. We formalize these information 
gathering tasks as actively identifying a set of nodes in a net¬ 
work that maximize a set function quantifying their informa¬ 
tiveness. Many natural objectives for this purpose satisfy sub- 
modularity, an intuitive diminishing returns condition (c.f., 
[Krause and Guestrin| ( |2Ql 1| )). Eor instance, in the social Q&A 
network (or web graph), the problem of finding experts in the 
network with the desired skills to answer the question (or the 
documents satisfying the informati on needs) can be cas t as 
submodular function maximization I jEl-Arini et al, 200^ . In 
the message routing problem in the Milgram’s experiment, 
the utility function can be modeled as reduction in distance to 
the target, measured as the minimal distance from one of the 
selected nodes to the target. 

Local visibility of the network. Existing approaches for 
submodular function optimization are based on the key as¬ 
sumption that the ground set (of all nodes) is known in ad¬ 
vance. With this assumption, greedy selection based on the 
marginal utilitie s of the nodes provides ne ar-optimal solutions 
to the problem [Nemhauser et al, 1978| . However, having 
access to the entire network is unrealistic in many real-world 
applications for various reasons. Eor instance, due to pri¬ 
vacy concerns, node visibility within social networks (such as 
Eacebook or Linkedin) is restricted to nodes we already con¬ 
nected to, and only these nodes can be target for routing tasks 
or posting a question. Even if the whole network is accessible 
(in a centralized system), the users may be more willing to re¬ 
spond and provide help to solve the task when routed through 
their acquaintances because of social incentives to help the 
peers and friends. The fundamental question is how to ex¬ 
plore the local neighborhood of the network with the goal of 
maximizing the utility function over the selected nodes. 

1.1 Overview of our approach 

We present a general approach to information gathering on 
networks under visibility constraints. In our model, the al¬ 
gorithm starts from an initial node (for instance, the individ¬ 
ual in a social Q&A network posting the question or seeking 
expertise for a task). New nodes become available for selec¬ 
tion only once one of their neighbors has been already cho- 




























sen. We model the local visibility constraints via limiting the 
number of hops within which the neighborhood of selected 
nodes becomes visible. Given this, the selection algorithm 
has to choose between exploring the network by selecting 
high degree nodes to expose new nodes and edges, or exploit¬ 
ing the currently accessible neighborhood by choosing nodes 
that provide maximal marginal utilities. Our main algorith¬ 
mic contribution is a novel algorithm NetExp for this prob¬ 
lem. We analyze its performance both in settings where no 
structural properties of the network are known, and in more 
specific settings capturing properties of real-world collabora¬ 
tive networks. Our main contributions are: 

• A formal model of information gathering in networks with 
local visibility constraints, capturing several real-world 
applications, such as task routing in social networks. 

• A novel algorithm NetExp for this problem that actively 
explores the accessible local neighborhood to increase vis¬ 
ibility of the unseen network, while at the same time 
exploiting the value of the information available in this 
neighborhood. We analyze the performance of our algo¬ 
rithm and provide theoretical bounds depending on the 
network structure. 

• Evaluation of our approach on data collected from a real- 
world application of task routing in a social Q&A system 
deployed within a large enterprise, to show the practical 
applicability of our methodology. 


2 Related Work 

Search and navigation with local knowledge. Our work 
is inspired by the ideas of navigation and sear ch in net- 
works with local knowledge. The seminal work of |Kleinberg| 
( |2000t[200^ addressed algorithmic questions of network for¬ 
mation and search strategies, to understand when short paths 
of acquaintances exists and are discoverable by local naviga¬ 
tion. In our work, we generalize the task of navigating to a 
target, to tha t of information gathering in a network. |Adamic| 
et al I ( |20Q1| ) and | Adamic and Ad'^ ( |2Q05| ) compared different 


local search strategies, for instance, following high-degree 
nodes or using proximity in an organizational hierarchy in the 
context of an email network. [Leskovec and Horvitz] ( |2Q14| ) 
tackled the challenge of navigating to a target in a geospatial 
network from a source using local knowledge of the network 
and proposed different navigation strategies - based on de¬ 
gree, or being closer in distance to the target, as well as com¬ 
bining these two strategies. Local search and team recruit¬ 
ing have also been studied in context of strategic participants 


via providing incentives jKleinberg and Raghavan, 2005 


Kota and Narahari, 2010| , which also formed the basis of re 
cruiting participants in the Red Balloon challengJ^. 

Task routing. [Richardson and White] ( |2011[ ) developed 
“IM-an-Expert”, a synchronous social Q&A system, where 
a user can pose a question via instant messaging that is 
then routed to an “expert” by the centralized system. Each 
posted question is characterized by the features and required 
skills; the users in the system have different expertise level, 
and the central system routes the question to the available 
experts. The main difference in our setting is the absence of 
any central authority, leading to the challenge of using local 
navigation to find the experts. More advanced variants of the 


social quer y model are proposed by [Horowitz and Kamvar 
( |2010[ ) and [Banerjee and Basil] ( [2008[ ), however t heir pro- 
posed models do not provide theoretical guarantees. [Jeong et 
[5^ ( |2013[ ) proposed “SearchBuddies” that route questions to 
friends in online networks, but did not study the algorithmic 
aspects of this routing. The challenge of locating experts in 
a decentralized ni anner has been studied [ [Nushi et < 2 /., 2015[ 
Xu et al, 2Q10[ , however, no form al analysis has been 


provided for the proposed techniques. [Zhang et al\ ( 2012| ) 
considered task routing for prediction tasks, by designing 
routing-based scoring rules and studying their truthfulness 
and efficiency in the equilibrium. However, we are not 
focusing on strategic aspects of the participants, and rather 
the algorithmic aspects of information gathering in networks. 

Explore-exploit dilemma. Many reinforcement learning 
and online learning problems are associated with explore- 
exploit dilemma, and several solutions use the framew ork of 
multi-armed bandits (MAB) [ Lai and Robbins, 1985 [. Moti- 
vated by applications in social advertisement, Eang and Tao 
( [2Q14[ ) considers social-MAB where the actio ns or arms are 
the us ers of the underlying social network. Bnaya et al 
( [2Q13[ ) considers the application of social network search and 
targeted crawling by using MAB framework. One of the key 
differences in our work is that the “exploration” refers to ac¬ 
quiring access to more nodes in the network that were previ¬ 
ously inaccessible, and is very different from the usual notion 
of “exploration” used in MAB literature. 

3 Problem Statement 

We now formalize the problem addressed in this paper. 

Set of nodes and the network. We consider a set of nodes 
{e.g., a population of people or users of a system) denoted by 
set V = {^ 1 , t’ 2 ,..., of size \ V\. There is an underly¬ 
ing network over the nodes, denoted by G = {V^E). Eor now 
we will not make any assumptions about its structure. Instead 
we discuss this further during the analysis, as our bounds 
will depend on the specific assumptions that we assert. 

Task and the utility function. We denote a task as T, 
associated with an initial node v^. Eor instance, in a social 
Q&A system, the task T could be a question, and is the 
user posting the question. We associate with each task T a 
function over the set of nodes, given by /t : 2^ ^ M, quan¬ 
tifying their informativeness/expertise. Given a set of nodes S 
selected for task T, the utility achieved from this set is given 
by friS). We assume each /t to be non-negative, monotone 
(i.e., whenever S' C S" C y, it holds that /(S) < f{S')) 
and suhmodular. Submodularity is an intuitive notion of di¬ 
minishing returns, stating that, for any sets S C S' C V, and 
any given node v ^ S', it holds that /(S U {^}) — /(S) > 
/(S'U{^}) — /(S'). These conditions are general, and are 
satisfied by many realistic, as well as complex utility func¬ 
tions for information gathe ring [ Krause and Guestrin, 201 ![ 
Krause and Golovin, 2012| . 

Local visibility and connectivity of selected set. We seek 
algorithms for the setting where the network is revealed in¬ 
crementally as more nodes are selected. We denote the /-hop 
neighborhood of a node v as A/’(^, /) C y, to be the set of all 
nodes in the network that are connected to v either directly or 
via at most (/ — I) intermediate nodes. Eor simplicity, we shall 







































































Figure 1: Illustration of local visibility constraints for I deg = 
2 , lyai = 2 . The currently selected set is given by S = 
{vi,V 2 ,V 3 }- The 1 -hop neighborhood of S, given by set 
is now accessible to the algorithm for selec¬ 
tion. Given lyai = 2, the algorithm can evaluate / to 
compute marginal utilities for the set {i; 4 , 

Given I deg = 2, the 1-hop neighborhood of the nodes 
{^ 4 , ^ 5 , ^65 ^ 7 , ^85 ^ 9 } is visible to the algorithm in order to 
find the network they expose. For instance, the algorithm can 
compute that node vj exposes the set of nodes {vq^ ^ 11 , ^ 12 } 

assume that v is also included in N'{v, 1). For a set of nodes S, 
we define its /-hop neighborhood as N'{S, 1) = /). 

Since the network is unknown in the beginning, we seek an 
algorithm that incrementally selects nodes always within the 
1 -hop neighborhood of already selected nodes. 

In order to make informed decisions, we will assume that 
some information is being revealed to the algorithm while 
nodes are being selected. In particular, assuming we have al¬ 
ready selected nodes S, for every node v in N{S, I deg), its 1- 
hop neighborhood N{v,l) is visible. This essentially means 
that the algorithm has sufficient information in order to find 
the new network nodes that will be exposed if v is added to the 
current set. We furthermore assume that the objective func¬ 
tion / can be evaluated for every node v in N{S, Ivai)- The 
constants I deg and lyai will be specified later. Figure [^illus¬ 
trates this with a simple example for I deg = 2 and lyai = 2. In 
real-world social networks (such as Facebook or Linkedin), 
the visibility is usually restricted to Ideg = 1 and lyai = 1 due 
to privacy settings. 

Optimization problem. Consider a task T originating at 
any node and / its associated utility function. We seek to 
select a set of nodes S of value at least Q in terms of utility. 
The process starts from the initial node and sequentially 
adds new nodes maintaining the connectivity of S. We are 
interested in designing an algorithm Alg that has only Ideg 
and lyai visibility, and achieves Q while minimizing the cost 
of the selected set. Formally, we define the optimization prob¬ 


lem as follows: 

S'* =argmin |S'| ( 1 ) 

scv 

s.t. f{S) > Q ( 2 ) 

S is connected in G (3) 

eS (4) 


Algorithm Alg has Ideg^ Uai visibility (5) 

Note that due to constraint < 0 , an optimal solution to this 
problem is in fact not a fixed set, but a policy that specifies 
which next node to select as a function of the information 
revealed. We also define a fixed optimal solution set VOpt 
that achieves value of Q, not subject to any computational 
constraints nor the constraints given by ([^, 0 and 0 . 


4 Our Algorithm NetExp 


We begin by describing the high level ideas behind our algo¬ 
rithm NetExp and then provide details. 

4.1 Overview of basic approach 

To present some of the key insights in designing NetExp, let 

us first consider two extreme settings. 

Special case: fully connected network. Let us first 
consider the extreme setting where the network graph G is 
fully connected. This, in turn, makes the constraints 0, 
0 and 0 for the optimization problem as posed in Sec¬ 
tion redundant. In this case, it r educes to an instance of 
standard submodular optim ization i Nemhauser et al, 1978[ 
[Krause and Guestrin, 201 1| . Ignoring the computational con¬ 
straints, the size of the fixed optimal solution set is given by 
I VOpt I , as defined in Section Obtaining this solution is 
intractable, however a greedy selection based on the marginal 
utilities of the nodes can provide a solution of size bounded 
by 0(1 VOpt I • ln(^)) to achieve a utility of (1 — /3) • Q where 


p indicates a tolerance level. If / is integral, any ^ > XjQ 
suffices to ensure complete coverage (i.e., exact value Q). 

Building the network backbone. Now suppose the net¬ 
work is arbitrary. One approach would be to first build a con¬ 
nected “backbone” of the network by selecting a set of nodes 
S that covers the whole set V, i.e., A/’(5', 1) = Once 
this “backbone” is built, the whole network is visible and 
all the nodes are accessible to be selected by the algorithm. 
Subsequently, the problem again reduces to that of standard 
submodular optimization. This “backbone” is called a con¬ 
nected dominating set of the graph, and the size of minimal 
connected dominating set is given by 7 ^. Computing such a 
minimal connected dominatin g set is intractable, but app roxi- 
mation algorithms are known |Guha and Khuller, 1998| . One 
naive approach to Probleml^is hence to first build a complete 
“backbone”, following by greedy selection of nodes based on 
marginal values. This approach guarantees a feasible solu¬ 
tion, but it is obviously undesirable for large networks where 
IG might be very large. Furthermore, there may be some high 
value nodes closer to the initial node, and hence building the 
complete dominating set may be unnecessary. 

4.2 Explore-exploit dilemma 

The two basic extreme settings highlight two key regimes that 
any policy must be able to handle: Exploit an exposed (sub)- 
graph by selecting nodes to achieve value Q, and explore an 
unknown graph to expose nodes for selection. Our algorithm 
will interleave these steps. 

Exploration of the network. One natural choice for 
adding a node to the current set is to select one that pro¬ 
vides maximal “exposure” of the unconnected network. The 
“exposed” neighborhood of the selected set S' is the / = 1 
neighborhood excluding S, formally given by A^(S, 1 ) = 
M{S^ 1) \ S. Assuming we have already selected a set S, 
adding a new node v exposes an additional \JV{S U 1) \ 

N'{S, 1 ) I nodes that becomes available to the algorithm to be 
selected. This node v may not provide immediate value, how¬ 
ever, the additional connections in the newly exposed network 
might be useful, potentially helping to discover and connect 
with the required high valued nodes. We call this greedy, 
value agnostic, choice “exploration”. 









Algorithm 1: Algorithm NetExp 


1 Input: 

□ Task: T; Initial node: Utility function: /; 

□ Local visibility: {Ideg^kai}', 

□ Exploration rate: e; Value required: Q; Tolerance: I3\ 

2 Output: selected set S'; 

3 Initialize: S = i = 1; e* = e; 

4 while f{S) < {I - p) ’ Q do 

5 if \N{S U {v}, 1)| = |A/'(5,1)|) then 

6 I e* = 0; > {i.e., M{S, 1) = V) Update e* 

7 With prob. e% a* ^ Explore; else, a* ^ Exploit; 

8 if a* = Explore then 


9 


n*=argmaxns:;e[i.. 

10 


5 

^ = 5un*; 

11 


if Ideg = 1 then 

12 



Randomly pick v* ( 

13 



S' = S'UK}; 


else 


14 


n* = argmaxns.;g[i 

15 


6 

' = 5un*; 

16 

i 

= i + 1; e® = e® 


•^de 


|Ar(sunf,i)\Ar(s,i)| 

I 


fiSUUf)-fiS) . 


17 Output: S 


Exploitation of value. The other natural choice of select¬ 
ing the next node is to add the one that provides immediate 
value to the set in terms of maximal marginal utility value. In 
context of an already selected set S, adding a new node v pro¬ 
vides a marginal gain in value, given by f{S U {i;}) — f{S). 
We call this choice “exploitation”, as it greedily maximizes 
utility given the currently exposed network. 


4.3 Algorithm NetExp 

Interleaving network exploration with exploitation. The 

key idea is to interleave the two choices of network explo¬ 
ration and exploitation. While the exploration step will con¬ 
tinue towards building the connected dominating set (that we 
need in the worst case anyways as per results in Theorem [^, 
the exploitation step will greedily select the nodes to maxi¬ 
mize the information gathered and terminate the algorithm as 
soon as the desired level of utility is achieved. Our algorithm 
NetExp is illustrated in Algorithmic It turns out that this 
simple approach allows us to derive tight theoretical bounds 
on the performance, and it also runs quite efficiently for vari¬ 
ous problem instances. NetExp interleaves exploration and 
exploitation with e probability, as illustrated in Step of Al¬ 
gorithm [C The e is constant for the procedure and is pro¬ 
vided as input. If further prior information about the network 
properties or the distribution of features is available, this pa¬ 
rameter can be tuned (for instance, whether to do more ex¬ 
ploration or more exploitation based on such properties). In 
Algorithmic we simply use a constant e, and when the full 
network is exposed (or dominating set for the network is al¬ 
ready built), we set e = 0 (see Step|^. 

Look-ahead during exploration As noted above, the ex¬ 
ploration steps of the algorithm aim towards building the 


connected dominating set. Negative results from Guha and 


Khuller| ( [19981 ) show that to effectively build connected dom¬ 


inating sets, adding one node at a time based on the crite¬ 
rion of maximal “exposure” is not sufficient, as such a greedy 
approach may need upto U(|U|) nodes to build a connected 
dominating set. Our ide a is based on the intuition used by 
|Guha and Khuller| ( 1998| ) on how to effectively add upto two 
nodes, when I deg > 2, in a way to be able to build efficient 
connected dominating sets. We generalize the idea of one- 
step look ahead for Ideg = 2 to that of doing a / — 1 step look 
ahead for Ideg = I during exploration. To formalize this, we 
introduce the concept of a chain of length 1. Consider the cur¬ 
rent selected set S. We define an /-chain from S, denoted by 
nf as an ordered set satisfying the following constraints: 

nf eV(5,i); 


nf eV(5unf,i)\V(5,i); 

eSf{s unf_i, 1) \X{s unf.^,i); 


In Step IC all possible chains of length 1 to / are enumer¬ 
ated to find the set of nodes to add at every iteration. While 
only considering chains is not required for the analysis, it re¬ 
stricts the search space. Nevertheless, the number of chains 
to consider is exponential in the lookahead. However, this is 
not an issue for practical settings where / < 2 usually. 

Look-ahead during exploitation The similar idea of look¬ 
ahead as used above is also employed during the exploitation 
step, and is done in Step 14 This look-ahead can also be 
thought of as moving in the direction of the utility function’s 
(discrete) “gradient”. _ 

The case of Ideg = !• Recently, |Borgs et al\ (|2012|) intro¬ 
duced a simple randomization technique to efficiently build 
up connected dominating sets when “look-ahead” is not pos¬ 
sible, i.e, for the case Ideg = 1- Specifically, after a node v 
is added, another randomly selected node from its neighbor¬ 
hood A/'(l’, 1) that is newly exposed is also added. We use the 
same trick in NetExp, in Step|^of Algorithmic 


5 Performance Analysis 

We now analyze the performance of NetExp. The proofs 
and details are avail able in the extended version of the paper 
i Singla et al, 2015| . 

5.1 Analysis for general settings 
We first analyze NetExp for general settings, without any 
assumptions on the network structure or the utility function. 
We will state our results in terms of two important network 
properties: i) the maximum degree of any node in the graph 
G, which we denote as A^, and ii) smallest siz e of any con¬ 
nected dominating set as defined in Section [TT] We start 
with analyzing the setting where {Ideg = 2, Ivai = 1) (similar 
results hold for Ideg ^ 2, lyai > 1 as well): 

Theorem 1. For Ideg = % Ivai = NetExp terminates 
with set S achieving utility of at least (1 — /3) • Q, satisfying 
the constraints of and •ID for Problem |7] with the 

following bound on the size of S in expectation (over the coin 
flips), given /^y E[|5'|] < 

(k (2 + 2 In(AG)) + (^ ■ IVOPTi ■ ln(i)) 

Common social networks only satisfy Ideg = 1 {i^e., nodes 
can see their friends’ friends). Eor this more challenging case 
we can still prove the following (slightly weaker) result: 





























Theorem 2. For I deg = 1. Ivai = 1,NetExp terminates 
with set S achieving utility of at least (1 — /3) • Q, satisfying 
the constraints of 0 . <0 and 0 for Problem with the 
following bound on the size of S in expectation (over the coin 
flips) that holds with probability at least 1 — given by 

E[|5|] < (i ■ (4 + 2 ■ In(AG)) -ih) + (■ I VOpt| -111(1)) 


The proof of Theorems [^andj^involves separately analyz¬ 
ing the sequence of explore and exploit actions during the ex¬ 
ecution of the procedure, and ensuring that interleaving these 
sequences preserves the analysis. 

For the general setting, we also have the following lower 
bound, showing that the dependency on 7 ^ is unavoidable. 
Theorem 3. For any bounded Ideg cind l^ah there exists a 
problem instance for which any feasible policy will need to 


select a set of size at least |5'| > max 


(7&, |VOPT|) 


The proof of Theorem follows from the arguments of 
Section |4.1| and by crafting a worst-case instance of graph 
structure and distribution of node values for any given policy. 


5.2 Analysis for realistic settings 

We now analyze the results for the specific setting motivated 
by real-world collaborative networks such as co-authorship 
networks in academic communities. 

Feature distribution and network structure. The nodes 
are associated with subsets of a feature set X of size \X\. 
Considering the specific domain of co-authorship networks, 
a feature x e X could, e.g., be an indicator variable denot¬ 
ing whether a user has published a paper in an AI conference. 
For each feature x G A', let the set of nodes possessing that 
feature be given by I 4 . We consider each feature x e X 
as a “social dimension” that induces a network given by 
G:r, = (Vx, Ex). In this model we assume that eac h network 


Gx is formed by a preferential attachment process iBarabasi 
and Albert, 1999| leading to a power law distribution of the 


degrees. The final network that we observe is obtained by an 
overlay of these networks, given by G = UxexGx. For a 
node V, we denote the value of its feature x G X as and is 
proportional to the rank order dictated by the degree of node 
V in graph Gx • This notion of feature values essentially cap¬ 
tures the “authority” of a node over a particular feature (for 
instance, a node gains expertise in “AI” if it has high degree 
of connections for this particular network). 

Characteristics properties of utility function. For a 
given task T, we consider a separable utility function given 
by: f{S) = ^xex where denotes the weight 

of function /^. Here, the function depends only on nodes’ 
features x. The maximum value of function f^{S) = 1 is 
achieved after including a top valued node with feature x. 

Theorem states the main result for these settings, and 
is proved using the res ults on the navigation properties o f 
“small-world” networks |Bollobas, 2003 Borgs W al, 2012| . 
Compared to the general settings of Theorem [IJ and the 
“small-world” networks are much easier to navigate as can 
be seen in the polylogarithmic bounds. 


Theorem 4. Consider Ideg = Ivai = 1 ^nd a task T origi¬ 
nating from user possessing a feature x^. With probability 
of at least 1 — o(l), NetExp terminates with set S achieving 


utility of at least (1 — P) • Q, with the following bound on the 
size of S in expectation, given Z?y E[|5'|] < 

(f 0(ln4(|T4o|))) + (^-0( ln"(|V;|) + |VOpt|)) 

ccGX 

Next, we state the lower bound in Theorem which fol¬ 
lows from the expected diameter of the graph obtain ed from 
preferential attachment |Bollobas and Riordan, 2004| . 
Theorem 5. For any bounded Ideg lyai, there exists a 
problem instance for which any feasible policy will need to 

select a set of size at least E[|5'|] > O ) 

6 Experimental Evaluation 

We now report on the results of our experiments. 


6.1 Experimental Setup 

Benchmarks and Metrics. We compare the performance of 
our procedure NetExp against the following three baselines: 
i) Random is a trivial baseline that randomly selects users 
from the exposed network to add to the set;; ii) Deg is equiv¬ 
alent to running NetExp with e = 1; and Hi) Val is equiva¬ 
lent to running NetExp with e = 0. The primary metric that 
we compare is the utility achieved as a function of the size 
of the selected set S. We further carried out experiments by 
varying e, varying the distribution of node values and com¬ 
paring the size of the exposed network. 

Utility function and features. We consider a feature set 
X of size \X\ associated with the nodes V, and denote the 
value of feature x G X for node v e V sls Xy. For a given 
task T, we consider separable utility functions per feature and 
can write f{S) = ^^7 ^xex ' /®(5'), with weights 

given by G {0,1}. We consider a probabilistic notion of 
utility derived from the feature values and define a submodu- 


lar utility function per feature f^{S) = (^1 — 

inspired by El-Arini et al. |( |200^ . 

Implementation choices. We considered a realistic setting 
of Ideg = 1 and lyai = 1 visibility. For running different al¬ 
gorithms, we considered value quota (5 = 1, with tolerance 
of P = 0.05. We used e = 0.5 for NetExp across all of 
the datasets without any further tuning. For the implementa¬ 
tion of the NetExp, we report results where we skipped the 
Step of Algorithm of adding a random user after every 
exploration step for Ideg = 1 case. While this step is useful 
to handle the worst case scenarios of the graph configuration 
to achieve desirable bounds, for practical purposes, it did not 
provide any additional benefit or performance gain. 


6.2 Datasets 

We performed experiments with three different datasets. 

Erdos-Renyi random graph. We created random graphs 
using the random graph model of Erdos and Renyi| ( |1959| ), 
with ground set of size \ V\ = 1000 and the probability of an 
edge existing between any two nodes given by Pedge = 0.01, 
a threshold probability above which the graph is connected 
with high probability. We then considered a total of 5 features 
(i.e., \X\ = 5), distributed i.i.d. across the nodes with prob¬ 
ability given by pyai = 0.001. Note, that this probability is 
essentially equivalent to having one non-zero valued node in 
the set per feature on expectation. For these non-zero valued 
nodes per feature, the actual values of the features are then 
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uniformly sampled and scaled to lie in the ra nge (0,1]. We 
also vary 'p^ai used by results in Figure |2(e)| We generated 
a set of the tasks, each associated with 3 required features 
out of 5 (i.e.. 3 functions have non-zero weights w^'). Dis¬ 
tribution of tasks is generated by a random sampling of the 
required features, as well as the initial user per task. 

Preferential attachment graph per skill. As a second 
data set, we used settings similar to those discussed in Sec¬ 
tion 5.2 We considered a ground set of size \ V\ = 100,000 


i d) Exposed net work (e) Varying fraction of experts (f) Varying e for NetExp 

2(a) 2(b) and 2(c)| shows the utility acquired as the size of the selected set S increases. Results show 
robustness of NetExp for different network structures and feature distributions. For the Erdos-Renyi graph model. Figure [2^ 
shows the e xposed network for different algorithms and Figure [2^ shows the results of varying the proportion of valued nodes. 
Figure [2^ shows the results of varying the e for NetExp, showing the robustness of the NetExp compared to explore-only 
(e = 1) and exploit-only (e = 0) methods. 

6.3 Results 

Utility acquired w.r.t set size. Eigure |2(a)|2(b)|2(c)| illus¬ 
trates the utility acquired as the specified upper bound on the 
selected set S is increased, showing the robustness of Net¬ 
Exp. The difference in the rate at which the utility is acquired 
for the random graph (Eigure 2(a)| ) and preferential attach¬ 
ment graph models (Eigure |2(b)| ) is in line with the theoreti¬ 
cal results we had studied for worst-case as well as scale-free 
networks in Theorem and Theorem respectively. The 
results on the real-world IMX dataset in Eigur^2^ shows a 
large positive gap in acquired utility for NetExp compared 
to the baselines. Note that the properties of the IMX dataset 
are very different from the two synthetic datasets, since the 
users are embedded in an organizational tree hierarchy. 

Exposed netw ork a nd varying feature distribution. The 
results in Eigure |2(d)| shows that NetExp also maintains a 
good balance of the exposed network in comparison to Val 
and to Deg {i.e., specifically optimizing for maximizing this 
exposure). Eigure |2(e)| compares the different algorithms 
(with \S\ = 50), as the proportion of valued nodes in the 
network are increased in the first dataset. 

Varying e for NetExp. Eigure [2^ shows the results for 
NetExp with varying e on different three datasets, by re¬ 
stricting |5'| to 100, 100 and 200 respectively. The results 
show the robustness of NetExp compared to explore-only 
(e = 1) and exploit-only (e = 0) methods. The results plot 
the utility acquired divided by the size of selected set, (i.e., 
f(^)/\s\), normalized to the same scale across datasets. 

7 Conclusions and Future Work 

We formalized the process of selecting nodes in a network for 
information gathering, limited by local knowledge and visi¬ 
bility of the network. Our model captures many real-world 
applications that have been previously studied only under the 
assumption that the underlying network and complete set of 


and total of 10 distinct features (i.e., \X\ = 10). The features 
are distributed i.i.d. with probability 0.2 (i.e., on the expec¬ 
tation, every node is associated with two features). The dis¬ 
tribution of the tasks is generated similar to the first dataset, 
with three randomly selected relevant skills per task. 

Real-world social Q&A system. As a third study, we used 
a real-world dataset from a synchronous social Q&A system 
named IM-an-Expert (IMX), which routes incom ing ques 


tions to candidate ans werers via instant messaging I Richard- 
son and White, 20rT| . We obtained usage data for this sys¬ 


tem, from a deployment inside a large enterprise over a period 
of two years (May 2012 to April 2014). During that time, 
the system was used by about 5,000 subscribed users (the 
“experts”), embedded in the network defined by the organi¬ 
zational tree hierarchy of over 180,000 nodes. These “ex¬ 
perts” each have a profile that had been inferred from their 
homepages, email distribution list subscriptions, and a self- 
described set of keywords describing their expertise. Users 
can pose a question in IMX via text, for example. Excel: How 
do I set default pivot table to ''Classic ” ?. Eor each such ques¬ 
tion, the system assigns an expertise score to the subscribed 
users, in the range (0,1], based on matching the content of 
question keywords against their profiles. While the deployed 
IMX system operates in a centralized manner with questions 
being routed to any expert, we simulated the system as only 
being able to locate experts using local search. 





































































































nodes is known and accessible in advance. We developed 
a general algorithm NetExp that provides a simple way to 
trade-off between network exploration and the exploitation 
of value in the exposed neighborhood. We proved strong the¬ 
oretical guarantees on its performance, and evaluated our ap¬ 
proach on several simulated instances as well as using data 
collected from a real-world social Q&A system. We see sev¬ 
eral interesting directions in which the presented work can 
be extended. We assumed that the utility of a node can be 
observed perfectly. Extending the proposed approach for the 
case of noisy/stochastic function evaluations would be practi¬ 
cally useful. Designing algorithms that adjusts the rate of ex¬ 
ploration e and take into account for some prior information 
available about the network structure and utility distribution 
is an interesting direction for future work. 
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A Proof of Theorem 1 

Proof of Theorem [D Let us denote the coin toss of the algorithm at each iteration by a random variable Yi and its outcome as 
i/i G {0,1}, with Hi = 1 corresponding to the action Explore and yi = 0 corresponding to the action Exploit. We shall 
separately analyze the sequence of Explore actions and the sequence of Explore actions. The selected set Si denotes the 
current set of nodes selected by the algorithm at the end of iteration i. 

Analyzing the Explore actions: Let us first analyze the Explore actions of the algorithm. The Explore actions of 
the algorithm aim towards building the connected dominating set of t he graph. These actions s imulate the greedy procedure 
of building connected dominating set with local I deg = 2 visibility in |Guha and Khuller, 1998|, however with following two 
differences: 

1. The Explore actions are interleaved with the Exploit actions that can add some nodes to the selected set, and hence 
need to be taken into account during the analysis of building connected dominating set. 

2. NetExp decides how many users to add in one round based on the following 


n* = 


arg max 

Uf:le[l...ldec 




I 


Eor the case of Ideg = 2, this essentialy means that the two nodes are added simultaneously, i f their average exposure is 
higher than the exposure of the single node. This is in contrast and optimal compared to the i Guha and Khuller, 1998| 
where two nodes are added as long as their joint exposure is higher than the single node. 

With these two differences, we analyze the execution of these Explore actions, and prove below that a connected 
dominating set is built with size bounded by (2 + 21n(AG')). Given that in expectation, only ^ exploration steps are taken, 
there is additional factor of - that appears. 


'G — ^ denotes the minimal connected dominating set of the graph, of size ygr. Let 2 ; G 


Dq denote one of the nodes 


Let D\ 

in this set, and each such node 2: covers a set of the nodes in V denoted by the set If a node in V is covered by multiple 
nodes in Dq, we randomly assign it to one of them. This way, {V^}\/ z in Dq creates a partition of ground set V into disjoint 
sets. Next, we associate colors with the nodes in V as follows: i) all the nodes are intitally white, ii) whenever a node is added 
to the current set, it is turned black. Hi) all the newly exposed nodes of current set, i.e., immediate neighbors are tur ned gray. 

We analyze the total size required to build connected dominating set using the charging analysis, similar to i Guha and 
Khuller, 1998| . Each time, a node is added to the set {i.e., turned black), we charge by distributing this cost of increasing the 
set size to the newly exposed nodes {i.e., those turned gray). This charging is only done when = 1. We then bound the total 
charge per partition for 2 ; G Dq. Note that, a node is charged only once as a node turns gray only once and hence the sum 
of these charges gives us the total number of the black nodes added (which is same as the size of the connected dominating set 
we would like to bound). 

Let us now analyze the total charge that is placed on the nodes of . We denote the number of white nodes in at iteration 
i by the notation \ V^\i. Note that, initially, all the nodes are white {i.e., = |1/^|) and \ V^\i eventually drops to zero once 

all the vertices in are turned gray or black. Consider the iteration ig, when for the first time, any node in has changed 
color from white to gray color. Before i^, all the nodes in remain white. At this iteration ig, up to two nodes in V turned 


\V^\iz number of nodes in have turned gray. The maximum charge that is applied on these nodes is given 

(charge would be less, if only one node is turned black, i.e., numenator would be 1, or charge would be less, if 


black, and \ V^ 
by: \v-\-\v-u^ 

additional nodes are turned gray in other partitions, i.e., demoninator would be higher). 

After this initial ig iteration for this set, consider any iteration j. Now, the node 2 ; is itself available to be selected (as part of 
the chain of size two), and it would expose atleast a set of size \ V^\j-i. Hence the total exposure of the nodes (one or two) that 
are selected by the procedure in this iteration must be higher than this. This implies that if any node is turned gray from in 
this iteration, the charge is bounded by ^ . 

The total charge that is placed on the nodes in set can be bounded as follows, taking into account that a charge is placed 
only for the explore actions {i.e., y = 1): 


2-(|F-|-|V-|,g) ^ ^ 2 ■ - \V^\d 

/lT;^rl \jr^\ \ ' ^3 / j 


° {\v^\-\v^k) 


< 2 + 2/j • 2 • 


<2+2.y] 
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n-i 




















< 2 + 21n(AG) 

In second and third steps above, the binary variable y is replaced by the value of 1. Now, given that there are total of 7 g 
such partitions, hence the total charge that can be placed (equivalent the total number of black nodes selected) is given by 

(2 + 21n(AG)) •7&. 

Analyzing the Exploit actions: Let us consider the phase when the connected dominating set is already built {i.e., there 
are no more remaining white nodes). In the worst case setting, if the selected set doesn’t provide the required utility yet, from 
this phase onwards, all the nodes are directly accessible to the algorithm as it would be for a centralized algorithm with full 
visibility and without any connectivity constraints. The Exploit actions of the algorithm aim towards max imizing the set 
function as in a standard greedy approach of submodular function maximization ( |Krause and Golovin, 2012| ), however with 
following two differences: 

1. The Exploit actions are interleaved with the Explore actions that can add some nodes to the selected set. However, 
the greedy algorithm for submodular function maximization is oblivious to this addition of other nodes and can be ignored 
from the analysis. 

2. The algorithm has already selected a set of nodes by the time this phase starts (when connected dominating set is built). 
However, again, the analysis of the greedy algorithm for submodular function maximization is oblivious to the starting set 
that may have already been selected by the procedure. 

With these two differences, the execution of these Exploit actions, starting from the phase when connected dominating set 
is built, leads to the bound of | VOpt| • In(^) on the number of additional nodes that could be added to the set until the required 

utility value is achieved. Given that in expectation, only exploitation steps are taken, there is additional factor of that 
appears. In fact, a tighter analysis can be done because of the following optimization trick in the algorithm that, when the full 
network is exposed (or dominating set for the network is already built), we set e = 0 (see Stepof Algorithm [^. □ 

B Proof of Theorem 2 

Proof of Theorem [21 The proof of this theorem follows exactly along the same arguments as that of Theorem However, 
for I deg = 1, the Explore action has a ra ndmozization step involved as discussed in Section]^ To recall, as shown in the 
seminal work of |Guha and Khuller, 1998|, a simple deterministic procedure for I deg = 1 that add users greedily based on 
maximizi ng the “exposure” may have worst case cost of |I2|. Recently, this barrier of I deg = 1 has been resolved by |Borgs et 
al, 2012| using a simple randomization technique. Specifically, after a user v is added, another randomly selected user from 
her neighborhood JViv. 1) that is newly exposed is also added. This is illustrated in the StepM^of AlgorithmMl 


Khuller, 1998| and follows similar techniques for the analysis. Our results for this theorem follows along the same analysis 
as done in proof of TheoremWe separately analyze the Explore and Exploit actions, ensure that interleaving these two 
actions doesn’t affect the analysis, and then combine these two actions by multiplying the two terms in the bounds by ^ and 
respectively. □ 


In fact, the analysis of this new randomizied technique proposed in jBorgs et al, 2012]rTs inspired from the iGuha and 


C Proof of Theorem 3 


Proof of Theorem [3j The theorem states the lower bound for the general settings considered in Section 5.1 in the worst case. 
Eor any bounded I deg and lyai , there exists a problem instance for which any feasible policy will need to select a set of size at 


least |5'| > max ^ 7 ^, IVOptI^ 

The proof of Theorem [^follows from the arguments of Section [TT] and by crafting a worst-case instance of graph structure 
and distribution of node values for any given policy. It is clear that any algorithm will require a set of size at least | VOpt|, by 
the definition of VOpt from Section Recall that VOpt is the algorithm operating without any computational constraints, 
with global visbility and without any contraints on connectivity of the set. We shall now show that the dependency on 7 ^ is 
unavoidable by creating a worst case problem instance for any given algorithm as follows. 

Consider a ground set V of nodes. All the tasks originate from a fixed node Vq eV. All the nodes V posses 0 utility, except 
a special valued-node Vx possessing utility of 1. Hence, the goal of any algorithm is to locate Vx starting from Vq. Consider 
any deterministic algorithm which adds the nodes to the set in a particular order. We split the execution of this algorithm in 
two phases. Phase 1 consists of the set of nodes which are added to the set, until a connected dominating set is formed and 
let us denote this set of nodes added to the set as Si . Phase 2 consists of the set of nodes which are added after a connected 
dominating set is formed, and we denote this set of nodes as S2 . The final set output by the algorithm is given by S = Si U S2. 
Eor this algorithm, we pick the valued node Vx G V \ Si. In order words, for any algorithm, a worst case distribution of the 
node values can be set up, such that the single valued node Vx cannot be located without constructing the connected dominating 
set. This gives us the worst case lower bound of 7 ^ for any determinisitc algorithm. Similarly, for any randomized algorithm 
or policy, it can be shown to have a dependency on 7 ^, with probability bounded away from zero. □ 



















D Proof of Theorem 4 

We begin by providing more details of the model introduced in Section [5^ and then proving Theorem]^ 

D.l Details of the model for realistic settings 

Skill distribution. The nodes are associated with subsets of a features from the set X of size \X\. Considering the specific 
domain of co-authorship networks, a feature x e X could, e.g., be an indicator variable denoting whether a user has published a 
paper in an AI conference. For each feature x G T', let the set of nodes possessing that feature be given by 14 and corresponding 
probability by Ox = . We assume that the features are independently distributed. Under independence assumption, we can 

state that if we take any node v and consider its m outgoing links, then the probability that at least one of nodes from these m 
links poses feature x is at least (1 — (1 — Ox)^). 

Characteristic properties of the graphs. We consider each feature x G T' as a “social dimension” that induces a network 
given by Gx = (Vx,Ex). In this model we assume that each network Gx is formed by preferential attachment process 
fBarabasi and Albert, 1999| , where nodes ^ G arrive over time, and each node on arrival forms rrix links to existing nodes 
with probability proportional to the degree of those nodes. This network leads to power law distribution, often seen in the 
collaborative networks such as co-authorship graphs or online social network. The final network that we observe is obtained 
by an overlay of these networks, given by G = {JxexGx- For a node v, we denote the value of its feature x e X SiS Xy 
and is proportional to the rank order dictated by the degree of node v in graph Gx, scaled to have bounded support of (0, !]• 
This notion of feature values essentially captures the “authority” of a node over a particular feature (for instance, a node gains 
expertise in “AI” if it has high degree of connections for this particular network). Note that, we consider undirected networks, 
and hence incoming or outgoing degree is same. 

Characteristics properties of utility function. We now characterize the utility function properties. For a given task T, we 
consider a separable utility function given by: f{S) = ^xex where denotes the weight of function /^. The 

weights are normalized, i.e., ^xex ~ Here, the function depends only on nodes’ features x, and its value is bounded 
between [0,1]. There are set of “irrelevant” features for which task T carries no utility, i.e = 0. For other “useful” features 
for which > 0, function carries positive, non-zero utility for every node possessing non-zero value of that feature. The 
maximum value of function f^{S) = 1 is achieved after including a top valued node with feature x. This is motivated by real- 
world settings where the goal is essentially to find few experts for each of the required skills for a task (such as a collaborative 
project in academia). 

Characteristics properties of VOpt. For a given task, consider the function associated with any specific useful feature x 
for which > 0. As discussed above, then the maximum value of function f^{S) = 1 will be achieved after including a top 
valued node with feature x. Hence, VOpt contains one top valued node for each of the features for which > 0. 


D.2 Proof 

Proof of Theorem m Th eorem[^is proved using the results on the navigation properties of “small-world” networks iBollobas, 


2003 [ Borgs et al, 2012| . Compared to the general settings of Theorem and the “small-world” networks are much easier 


to navigate as can be seen in the polylogarithmic bounds. 

Consider a task T originating from node possessing feature x^. Let the goal is to find at least one top valued node for each 
of the features with > 0 . Theorem [^states the main results for these settings, that is, with probability of at least 1 — o(l), 
NetExp (with I deg = 1, Ival = 1) terminates with set S achieving utility of at least (1 — /3) • Q, with the following bound on 
the size of S in expectation. 


E[|5|] < (1.0(ln4(|T4„|))) + (^.0( ^ \n\\VS + |VOpt|)) 

x^X 

The main ideas behind the proof of the above theorem are illustrated in Figure and discussed below. For the ease of putting 
forth the ideas, let us first consider only one useful feature/skill. 

Reaching high degree nodes in the graph Gxo. The first term in the summation • 0(ln^(|I4o|))^ is attributed to 
the process of identifying high degree nodes among the nodes possessing feature x^. Let us denote one such high degree 
node as r^^o. The exploration step of the algorithm will navigate to the node within 0( ln^(|I4o |)) steps such that the 

{txo) > (where Ag{v) denotes the degree of vertex v in the graph G). This follows from the navigation 

properties of the preferential attachment model of networks jBorgs et al, 2012| . Consider the process of navigating the graph 
Gx for a feature x, starting from any node with feature x. If we follow the maximum degree node in the I deg = 1 local visibility, 
we reach a node rx within O (ln^(|I41)) steps such that {tx) > * Aq^ , this holds with probability at least 1 — o(l) 

(where the o notation is w.r.t to size of the graph). Furthermore, with probab ility at least 1 — o(l), the maximum degree of any 
node in Gx, denoted by Aq^ is bounded by Ag, < -111(1141) ( iBollobas, 20^ ). 
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Figure 3: Illustration of the keys ideas of navigating to high valued nodes in the specific settings of preferential attachment 
graphs. Each triangle here represents a graph associated with a feature, with higher degree nodes of that feature located towards 
the top of the triangle. In this example, there are total of three features, and one useful feature shown by the green triangle. 

Given that in expectation, only ^ exploration steps are taken, the first term represents the bound on the number of nodes that 
could be added to the selected set towards reaching r^o . 

Jumping to some nodes with useful features. Then, the exploitation step would ensure that we can make a jump from 
to the nodes with useful features (i.e., > 0), with high probability. Consider a useful feature x, given the independence 

assumption of the feature distribution, the algorithm can jump to nodes with feature x, with probability at least: 1 — (1 — 
and this is 1 - o(l). 

Reaching high degree nodes with useful features. Both the exploration and exploitation step would help with this process. 
However, the exploration step may still continue adding higher degree nodes in the graph Gxo or other features th at do not 
provide any further utility, hence the second term in bound of Theorempjhas a factor of Based on the results of I Borgs et 
al, 2012| , consider starting from any node and navigating graph Gx, if we follow the maximum degree node in the I deg = 1 
local visibility, we reach a small constant number of high degree (1 + /^) nodes, denoted by within 0(ln^(|I4|) + k) 
steps such that Ac^ivx) > in^(\v |) ' for ^ Rx^ this holds with probability at least 1 — o(l). As per the power law 

distribution of the degrees, this set of (1 + /^) highly-skilled nodes are within top I — P percentile top rank for the feature 
X, with probability 1 — o(l). 

When considering only one useful feature, putting these ideas together provide us the main results of Theorem for these 
settings, which holds with probability of at least 1 — o(l). The same ideas also carry over when there are multiple useful 
features. This follows from the fact that once 1 + high valued nodes of one particular feature have already been added to 
the set, then the nodes of this feature will not provide any further marginal gain, and hence the exploitation step will jump to 
nodes of another useful feature that is required by the task. The second term in the statement of theorem above indeed needs to 
be summed only over the useful features. This summation over all the features X in the second term represents the worst case 
setting when all the features are useful and required by the task. 

In our specific problem setup in Section [DT] and the results Theorem]^ the utility function requires only one top valued node 
per useful feature, hence hz can be set to 0 in the above bounds. □ 


E Proof of Theorem 5 

Proof of Theorem O Next, we state t he lower bound in Theorem!^ which follows from the expected diameter of the graph 
obtained from preferential attachment |Bollobas and Riordan, 2004] r 

T he th eorem states the lower bound for the specific model considered in Section [5^ with further details pr ovided in Sec¬ 
tion M. The results follow from the expected diameter of the graph obtained from preferential attachment |Bollobas and| 
Riordan, 2004| . We can create the problem instance as follows. Let there be only one unique feature x (i.e., \X\ = 1 and let all 
the nodes V possess this feature, i.e., V = Vx with all the nodes possessing this feature. All the tasks originate from nodes for 
which the degree is one. 

The expected diameter of the graph obtained from the preferential attachment process is given by O ^ 




^ _ ^lnln(|y|); 

and Riordan, 2004| . Any algorithm or policy, (even if it has full knowledge of the network, i.e., with unbounded I deg 


Bollobas 


^_ _ vai) will 

have the expected size of selected set at least that of the diameter as it represents the minimal path length to connect the nodes 
from where task originates (those with degree of one in this problem instance) to the highest valued node. □ 



























